TorchAO compile + offloading tests #11697

a-r-r-o-w · 2025-06-11T22:57:45Z

No description provided.

HuggingFaceDocBuilderDev · 2025-06-11T23:04:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

Thank you! Appreciate the detailed comments.

sayakpaul · 2025-06-16T10:57:44Z

tests/quantization/test_torch_compile_utils.py

@@ -64,7 +64,29 @@ def _test_torch_compile_with_cpu_offload(self, quantization_config, torch_dtype=
            # small resolutions to ensure speedy execution.
            pipe("a dog", num_inference_steps=3, max_sequence_length=16, height=256, width=256)

-    def _test_torch_compile_with_group_offload(self, quantization_config, torch_dtype=torch.bfloat16):
+    def _test_torch_compile_with_group_offload_leaf(self, quantization_config, torch_dtype=torch.bfloat16):


Maybe we can test with parameterized where we test with and without streams?

tests/quantization/torchao/test_torchao.py

sayakpaul

Thanks!

a-r-r-o-w · 2025-06-17T04:14:31Z

@sayakpaul I'm not sure what's causing the tests to fail 🤔 This PR guards the compile test with torchao version/installation requirement but still seemingly causes tests to fail. I'll try to take a look later today if we don't have a quick understanding of what happened here

sayakpaul · 2025-06-17T04:23:13Z

Exactly! Nothing comes to mind as to what could trigger this!

sayakpaul

Was able to spend some time and the following diff solves the problem:

Expand

diff --git a/tests/quantization/torchao/test_torchao.py b/tests/quantization/torchao/test_torchao.py
index ddf97aca5..28454aae9 100644
--- a/tests/quantization/torchao/test_torchao.py
+++ b/tests/quantization/torchao/test_torchao.py
@@ -631,11 +631,14 @@ class TorchAoSerializationTest(unittest.TestCase):
 
 @require_torchao_version_greater_or_equal("0.7.0")
 class TorchAoCompileTest(QuantCompileTests):
-    quantization_config = PipelineQuantizationConfig(
-        quant_mapping={
-            "transformer": TorchAoConfig(quant_type="int8_weight_only"),
-        },
-    )
+    @property
+    def quantization_config(self):
+        config = PipelineQuantizationConfig(
+            quant_mapping={
+                "transformer": TorchAoConfig(quant_type="int8_weight_only"),
+            },
+        )
+        return config
 
     def test_torch_compile(self):
         super()._test_torch_compile(quantization_config=self.quantization_config)

ChatGPT does a nice job of explaining what is happening:
https://chatgpt.com/share/685951bc-7c88-8013-b317-62683d1a1fa9. What I didn't investigate is that how come the other TorchAO tests are not getting flagged because of torchao installation errors 🤷

sayakpaul · 2025-06-23T12:41:34Z

tests/quantization/test_torch_compile_utils.py

        torch._dynamo.config.cache_size_limit = 10000

        pipe = self._init_pipeline(quantization_config, torch_dtype)
        group_offload_kwargs = {
            "onload_device": torch.device("cuda"),
            "offload_device": torch.device("cpu"),
            "offload_type": "leaf_level",
-            "use_stream": True,
-            "non_blocking": True,


Should keep the non_blocking=True or make it an argument of the function like use_stream?

update

38c213f

a-r-r-o-w added 3 commits June 16, 2025 08:15

Merge branch 'main' into torchao-compile-tests

8173a29

update

fb99d94

update

b69d099

a-r-r-o-w marked this pull request as ready for review June 16, 2025 08:51

a-r-r-o-w requested a review from sayakpaul June 16, 2025 08:52

sayakpaul approved these changes Jun 16, 2025

View reviewed changes

a-r-r-o-w added 2 commits June 16, 2025 22:57

update

2c608d1

update

acd86ed

sayakpaul approved these changes Jun 17, 2025

View reviewed changes

sayakpaul added performance Anything related to performance improvements, profiling and benchmarking torch.compile labels Jun 18, 2025

Merge branch 'main' into torchao-compile-tests

6d5f77e

sayakpaul reviewed Jun 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TorchAO compile + offloading tests #11697

TorchAO compile + offloading tests #11697

Uh oh!

a-r-r-o-w commented Jun 11, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jun 11, 2025

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul Jun 16, 2025

Uh oh!

Uh oh!

sayakpaul left a comment

Uh oh!

a-r-r-o-w commented Jun 17, 2025

Uh oh!

sayakpaul commented Jun 17, 2025

Uh oh!

sayakpaul left a comment •

edited

Loading

Uh oh!

sayakpaul Jun 23, 2025

Uh oh!

Uh oh!

TorchAO compile + offloading tests #11697

Are you sure you want to change the base?

TorchAO compile + offloading tests #11697

Uh oh!

Conversation

a-r-r-o-w commented Jun 11, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jun 11, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

a-r-r-o-w commented Jun 17, 2025

Uh oh!

sayakpaul commented Jun 17, 2025

Uh oh!

sayakpaul left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sayakpaul left a comment •

edited

Loading